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ABSTRACT 

We analyze the transmitted flux in a sample of 17 QSOs spectra at 5.74 < Ze,„ < 6.42 to 
obtain tighter constraints on the volume-averaged neutral hydrogen fraction, xhi, at 2 « 6. 
We study separately the narrow transmission windows (peaks) and the wide dark portions 
(gaps) in the observed absorption spectra. By comparing the statistics of these spectral fea- 
tures with a semi-analytical model of the Lya forest, we conclude that xhi evolves smoothly 
from lO^'''^ at z — 5.3 to lO^'*'^ at z = 5.6, with a robust upper limit xm < 0.36 at z = 6.3. 
The frequency and physical sizes of the peaks imply an origin in cosmic underdense regions 
and/or in HII regions around faint quasars or galaxies. In one case (the intervening HII re- 
gion of the faint quasar RD J1148H-5253 at z = 5.70 along the LOS of SDSS J1148H-5251 
at z — 6.42) the increase of the peak spectral density is explained by the first-ever detected 
transverse proximity effect in the HI Lya forest; this indicates that at least some peaks result 
from a locally enhanced radiation field. We then obtain a strong lower limit on the foreground 
QSO lifetime of > 11 Myr. The observed widths of the peaks are found to be systemati- 
cally larger than the simulated ones. Reasons for such discrepancy might reside either in the 
photoionization equilibrium assumption or in radiative transfer effects. 

Key words: cosmology; large-scale structure of Universe - intergalactic medium - quasars; 
absorption lines 



1 INTRODUCTION 

Although observations of cosmic epochs closer to the present have 
indisputably shown that the InterGalactic Medium (IGM) is in an 
ionized state, it is yet unclear when the phase transition from the 
neutral state to the ionized one started. Thus, the redshift of reion- 
ization, Zrei, is still very uncertain. 

After the first year WMAP data a possible tension was identi- 
fied between CMB and SDSS results. The high electron-scattering 
optical depth inferred from the TE-EE power spectra Te ~ 0.17 
(Kogut et al. 2003; Spergel et al. 2003) seemed difficult to be recon- 
ciled with the strong evolution in the Gunn-Peterson optical depth 
TGP at z = 6 (Fan et al. 2001; Fan et al. 2003), accompanied 
by the appearance of large dark portions in QSO absorption spec- 
tra (Becker et al. 2001; Djorgovski et al. 2001; Fan et al. 2006, 
hereafter F06). The 3-yr WMAP results have released the above 
tension by providing a smaller value for ~ 0.1, which im- 
plies Zrei ~ 11 for a modcl with instantaneous reionization (Page 
et al. 2006; Spergel et al. 2006). However, an increasing number 
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of LAEs are routinely found at 2: > 6 (e.g. Stern et al. 2005; lye 
et al. 2006; Stark et al. 2007), possibly requiring a substantial free 
electron fraction, resulting in an IGM relatively transparent to Lya 
photons. 

Constraints on the IGM ionization state derived by using Lya 
forest spectroscopy must take into account the extremely high sen- 
sitivity of TCP to tiny neutral hydrogen amounts. Indeed, a volume 
averaged neutral hydrogen fraction as low as .thi ^ 10~^ (Fan 
et al. 2002) is sufficient to completely depress the transmitted flux 
in QSO absorption spectra; thus, the detection of a Gunn-Peterson 
trough only translates into a lower limit for xm- For this reason, 
recently, many studies have tried to clarify if the SDSS data effec- 
tively require that the IGM was reionized as late as z ~ 6 (Gallerani 
et al. 2006, hereafter GCF06; Becker et al. 2006): in particular, 
GCF06 have shown that QSO observational data currently avail- 
able are compatible with a highly ionized Universe at that redshift. 

Clearly the determination of the reionization epoch is strictly 
related to the measurement of the neutral hydrogen fraction at 
« 6. To investigate this issue many different approaches can 
be used. To start with, it is worth mentioning that many authors 
have tried to constrain xui at high redshift by analyzing statisti- 
cally the optical depth inside HII regions around high redshift QSO 
(Mesinger & Haiman 2004; Mesinger & Haiman 2006), or by mea- 
suring the QSO bubble sizes (Wyithe & Loeb 2004; Wyithe et al. 
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Figure 1. Left panel: Evolution of the volume tilling factor of ionized regions for the eai'ly (red solid lines) and late (blue dotted lines) reionization models. 
Middle panel: Volume-averaged photoionization rate r_i2 = Fhi/IO^^'^s"^. The filled circles, empty circles, filled triangles and empty triangles show 
results obtained by F06. MMOl, B05 and B07, respectively. Right panel: Evolution of the neutral hydrogen fraction. Thick lines represent average results over 
100 LOS, while the thin lines denote the upper and lower neutral hydrogen fraction extremes in each redshift interval. Solid circles represent neutral hydrogen 
fraction estimates by F06; empty squares denote the results obtained in this work. 



2005; Bolton & Haehnelt 2007b; Maselli et al. 2007, Lidz et al. 
2007). However, sufficient ground for controversy remains due to 
intrinsic uncertainties of the various techniques. 

By deriving sizes of HII bubbles surrounding observed z — 
6.5 LAEs, Malhotra & Rhoads 2005 have provided an upper limit 
for the neutral hydrogen fraction xui ^ 0.2 — 0.5. This result is 
in quite good agreement with the upper limit xm ^ 0.45 found by 
Kashikawa et al. 2006, by interpreting the deficit measured at the 
bright end of the LAE Luminosity Function at z > 6 as a sudden 
change in the intergalactic neutral hydrogen content. Nevertheless, 
the increasing attenuation with redshift of the Lya line transmission 
could be partially explained as a consequence of the evolution in 
the mass function of dark matter halos, thus implying a much lower 
upper hmit, xm < 0.05 - 0.2 (Dijkstra et al. 2007). 

GRB spectroscopy has also tentatively used to constrain xuV, 
Totani et al. 2006 have observed a damping wing at wavelengths 
larger than the Lya emission line, finding that this feature can be 
explained at best by assuming an intervening damped Lya system 
immersed in a fully ionized IGM, and quoting an upper limit of 
Xm < 0.17 and 0.60 (68% and 95% confidence levels, respec- 
tively). 

Finally, the width distribution of dark portions (gaps) seen in 
QSO absorption spectra has been recently introduced in order to 
constrain the IGM ionization state (Paschos & Norman 2005; F06; 
GCF06). F06 has used the dark gap distribution, as observed in 19 
high-z QSO spectra, to put a preliminary upper limit on the IGM 
neutral fraction xm < 0.1 ~ 0.5. GCF06, by analyzing the sta- 
tistically properties of the transmitted flux in simulated absorption 
spectra, have shown that the gap and peak (i.e. transmission win- 
dows) width statistics are very promising tools for discriminating 
between an early (zrei > 6) and a late (zrei ~ 6) reionization sce- 
nario. Here we combine the previous two results: by comparing the 
observed transmitted flux in high-z QSO spectra with theoretical 
predictions we obtain tighter constraints on the neutral hydrogen 
fraction around 2 = 6, a crucial epoch in the reionization history. 

The plan of the paper is the following: in Section 2 we de- 
scribe the semi-analytical modeling adopted; in Section 3 we com- 
pare observational data with simulations. The implications of this 



comparison are given in Section 4; in Section 5 we evaluate the 
robustness of our method against the specific line of sight to the 
highest redshift QSO. The conclusions are summarized in Section 
6. 



2 SIMULATIONS 

The radiation emitted by QSOs could be absorbed through Lya 
transition by the neutral hydrogen intersecting the line of sight, the 
so-called Gunn-Peterson (GP) effect. The Lya forest arises from 
absorption by low amplitude-fluctuations in the underlying bary- 
onic density field (Bi, Borner & Chu 1992), and is a natural conse- 
quence of the hierarchical structure formation expected in the con- 
text of CDM cosmologies^. 

To simulate the GP optical depth (top) distribution we use the 
method described by GCF06, whose main features are recalled in 
the following. The spatial distribution of the baryonic density field 
and its correlation with the peculiar velocity field are taken into ac- 
count adopting the formalism introduced by Bi & Davidsen 1997. 
To enter the mildly non-linear regime which characterizes the Lya 
forest absorbers we use a Log-Normal model introduced by Coles 
& Jones 1991, widely adopted later on (Bi 1993; Bi & Davidsen 
1997; Choudhury, Padmanabhan, & Srianand 2001; Choudhury, 
Srianand, & Padmanabhan 2001; Viel et al. 2002; GCF06). In par- 
ticular, GCF06 have compared various Lya statistics, namely the 
Probability Distribution Function (PDF) and the Gap Width dis- 
tribution, computed using the Log-Normal distribution with those 

^ Throughout this paper we will assume a flat universe with total matter, 
vacuum, and baryonic densities in units of the critical density of f!m = 
0.24, Ha = 0.76, and fl,,h^ = 0.022, respectively, and a Hubble constant 
of Hq = lOQh km s^'^Mpc"^, with h = 0.73. The parameters defining 
the linear dark matter power spectrum are n = 0.95, dn/dlnfe = 0. 
erg = 0.82. Note that we have chosen a trg value higher than the WMAP3 
one (0.74). Indeed Viel et al. 2006, by combining WMAP3 data with SDSS 
ones, found erg ss 0.78 (0.86) analyzing high (low) resolution Lyo forest 
data. Mpc are physical unless differently stated. 
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Figure 2. Left panel: Evolution of the Gunn-Peterson optical depth for early (ERM, solid red line) and late (LRM, blue dotted). Thick lines represent average 
results on 100 LOS for each emission redshift, while the thin lines denote the upper and lower transmission extremes in each redshift bin, weighted on 100 
LOS. Filled and empty circles are observational data from Songaila 2004 and F06, respectively. Middle panel: Probability Distribution Function (PDF) of the 
flux at z=6.0. Filled circles are obtained by Fan et al. 2002. Thick lines represent simulated results averaged over 500 LOS, while the thin lines denote cosmic 
vaiiance. Right panel: Gap Width distribution in the redshift range 5.0-5.5. Simulated results are compared with observations by Songaila & Cowie 2002 (filled 
circles). The errors associated to both simulated and observed results denote cosmic vaiiance. 



obtained from HYDROPM simulations, finding a good agreement 
between the results. For a given IGM temperature, the HI fraction, 
xm, can be computed from the photoionization equilibrium as a 
function of the baryonic density field and photoionization rate due 
to the ultraviolet background radiation field. For all these quantities 
we follow the approach of Choudhury & Ferrara 2006, hereafter 
CF06. By assuming as ionizing sources QSOs, PopII and PopIII 
stars (the latter neglected here, see below), their model provides 
excellent fits to a large number of observational data, namely the 
redshift evolution of Lyman-limit systems, Lya and Ly/3 optical 
depths, electron scattering optical depth, cosmic star formation his- 
tory, and the number counts of high redshift sources. In the CF06 
model, a reionization scenario is defined by the product of two 
free parameters: (i) the star-formation efficiency /*, and (ii) the 
escape fraction fesc of ionizing photons of PopII and PopIII stars; 
it is worth noting that these parameters are degenerate, since dif- 
ferent parameter values could provide equally good fits to observa- 
tions. In this work, by fitting all the above observational constraints, 
we select two sets of free parameters values yielding two different 
reionization histories: (i) an Early Reionization Model (ERM) for 
(f*. PopII ~ 0.1; fesc, PopII = 0.07), and (ii) a Late Reionization 
Model (LRM) for (,f,,popii = 0.08; fesc,Popii = 0.04). We do 
not consider contributions from PopIII stars, as PopII stars alone 
yield = 0.07 (0.06) for ERM (LRM), marginally consistent 
withWMAP3 results^ 

Fig. 1 shows the global properties of the two reionization models 
considered. In the ERM the volume filling factor of ionized regions, 
Qhii = Vhii /Vtot = 1 at 2 < 7; in the LRM it evolves from 0.65 
to unity in the redshift range 7.0-6.0, implying that the Universe is 
still in the pre-overlap stage at z > 6, i.e. the reionization process 
is not completed up to this epoch. In the middle panel of the same 
Figure we compare the volume-averaged photoionization rate Fhi 



for the two models with the recent estimate by F06, and the ones 
by McDonald & Miralda-Escude' 2001, Bolton et al. 2005, and 
Bolton & Haehnelt 2007a, hereafter MMOl, B05 and B07, respec- 
tively. Finally, the evolution of the volume-averaged neutral hydro- 
gen fraction for the ERM and LRM is presented in the rightmost 
panel. 

The photoionization rate predicted by both models is in agree- 
ment with the results by B05 and B07 at in the range z = 4.0 < 
z < whereas at z = 5.5 (6) the ERM is characterized by a pho- 
toionization rate which is « 2 (6) times larger than the estimates by 
F06. In spite of these differences, our predictions for khi are con- 
sistent with F06 measurements. This apparent contradiction does 
not come as a surprise. In fact, the derivation of Fhi requires an 
assumption concerning the IGM density distribution. When mea- 
suring Fhi at 5 < z < 6, F06 assume the density Probability 
Distribution Function given by Miralda-Escude' et al. 2000, here- 
after MHR^. We instead adopt a Log-Normal (LN) model which 
predicts a higher probability to find overdensities A — p/p > 1 
than MHR. For example, at 2 = 6 and for A « 1.5, Pljv(A) ^ 
2 X Pmhr(A)). For this reason, once tqp is fixed to the observed 
value, the LN model requires a higher Fhi. As xhi oc A, these two 
effects combine to give a values of xm consistent with the data. 



3 COMPARISON WITH OBSERVATIONS 
3.1 Control statistics 

We first test the predictions of our model by applying various sta- 
tistical analysis to the simulated spectra and comparing our results 
with observations. Specifically, we use the following control statis- 
tics: (i) Mean Transmitted Flux evolution in the redshift range 2 — 6; 



^ Small contributions from PopIII stars, i.e. f*,Poplll = 0.013 
(f*,Poplll = 0.08), in the ERM (LRM), would yield re = 0.09 
(Te = 0.08), without affecting sensitively the results below. 



F06 require Fjji to match the MMOl measurement at 2; = 4.5. This 
estimate is based on a mean transmitted flux {F = 0.25) which is lower 
than the more recent measurements F 0.32 by Songaila 2004, which 
implies Fhi ~ 0.3. 
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Figure 3. Largest Gap Width distribution for the LR and the HR cases (left and right, respectively). Filled circles represent the result of the analysis of the 17 
QSOs observed spectra. Solid red (dotted blue) lines show the results obtained by the semi-analytical modeling implemented for the ERM (LRM). Vertical 
error bars measure poissonian noise, horizontal errors define the bin for the gap widths. 



(ii) Probability Distribution Function (PDF) of the transmitted flux 
at the mean redshifts z = 5.5, 5.7, 6.0; (iii) Gap Width (GW) dis- 
tribution in 3.5 < z < 5.5. For what concerns the GW statis- 
tics we define gaps as contiguous regions of the spectrum having 
a TCP > 2.5 over rest-frame wavelength {\rf) intervals > 1 A. 
This method was first suggested by Croft (1998) and then adopted 
by various authors (Songaila & Cowie 2002; Paschos & Norman 
2005; F06; GCF06). The comparison of model and observational 
results in terms of the above three statistics is plotted in Fig. 2. By 
checking our models we follow the same approach of GCF06, to 
which we refer for a complete description of the technical details. 

The outcome of the test is encouraging, as both ERM and 
LRM successfully match the observational data at z < 6 for the 
control statistics considered. This allows us to confidently proceed 
the comparison with more advanced statistical tools. 

3.2 Advanced statistics 

Since at 2 ~ 6 regions with high transmission in the Lya forest 
become rare, an appropriate method to analyze the statistical prop- 
erties of the transmitted flux is the distribution of gaps. In partic- 
ular GCF06 suggested that the Largest Gap Width (LGW) and the 
Largest Peak Width (LPW) statistics are suitable tools to study the 
ionization state of the IGM at high redshift*. The LGW (LPW) dis- 
tribution quantifies the fraction of LOS which are characterized by 
the largest gap (peak) of a given width. As far as this work is con- 
cerned, we apply the LGW and the LPW statistics both to simu- 
lated and observed spectra with the aim of measuring the evolution 
of x^i with redshift. 

We use observational data including 17 QSOs obtained by F06. 

* The definition of "peak" in the transmitted flux is similar to the "gap" 
one. A peak is a contiguous region of the spectrum over \rf intervals 
greater than the observed pixel size (fa 0.5 A) characterized by a transmis- 
sion above a given flux threshold (-F^jj = 0.08 in this work). 



We divide the observed spectra into two redshift- selected sub- 
samples: the "Low-Redshift" (LR) sample (8 emission redshifts 
5.7 < Zem < 6), and the "High-Redshift" (HR) one (9 emission 
redshifts 6 < Zem < 6.4). Simulated spectra have the same Zem 
distribution of the observed samples. For most QSOs we consider 
the (Xrf) interval 1026-1200 A and we normalize each width to 
the corresponding redshift path. Note that the LOS do not extend 
up to Zem ; the upper (lower) limit of the interval chosen ensures that 
we exclude from the analysis the portions of the spectra penetrat- 
ing inside the QSO HII (Ly/3) region. For the QSOs SDSS J1044- 
0125 and SDSS J1048-I-4637 we choose different intervals, namely 
1050-1183 and 1050-1140, respectively. These two objects have 
been classified as BAL QSO (Goodrich et al. 2001; Fan et al. 2003; 
Maiolino et al. 2004), since their spectra present Broad Absorp- 
tion Lines associated with highly ionized atomic species (e.g. SilV, 
CIV). By selecting the above intervals we exclude those portions of 
the spectra characterized by CIV absorption features which extend 
to z fa 5.56 {z fa 5.75) in SDSS J1044-0125 (SDSS J1048-^4637). 
Observed data were taken with a spectral resolution R ~ 3000 — 
6000; simulated spectra have been convolved with a gaussian of 
FWHM — 67 km/sec, providing R ~ 4500. Moreover each 
observed/simulated spectrum has been rebinned to a resolution of 
R = 2600. Finally, we add noise to the simulated data such that the 
flux F in each pixel is replaced by _F + G(l) x (j„, where G{1) is 
a Gaussian random deviate with zero mean and unit variance, and 
(Jn is the observed noise r.m.s deviation of the corresponding pixel. 

The results provided by the statistics adopted in this study 
are sensitive to the S/N ratio, since spurious peaks could arise in 
spectral regions with noise higher than the Fth adopted. Indeed, 
the shape of the LGW/LPW distributions depends on the Fth cho- 
sen. Thus, we consider two different values for Fth, namely 0.03 
and 0.08, respectively, and, for both of them, compute preliminary 
LGW/LPW distributions. Finally, the LGW/LPW distributions pre- 
sented are obtained as the mean of the preliminary ones, weighted 
on the corresponding errors (See Appendix A for a detailed dis- 
cussion). In our analysis we do not consider 2 QSOs presented 
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Figure 4. Largest Peak Width distribution for the LR and the HR cases (left and right, respectively). Filled circles represent observational data obtained by 
analyzing the observed spectra of the 17 QSOs considered. Solid red (dotted blue) lines show the results obtained by the semi-analytical modeling implemented 
for the ERM (LRM). Vertical en'or bars measure poissonian noise, horizontal errors define the bin for the peak widths. 



by F06, namely SDSS J1436+5007 anA SDSS J1630+4012, siiice 
these spectra have significantly lower S/N to apply LGW/LPW tests 
(continuum S/N < 7). 



3.2.1 LG W distribution 

We now discuss the LGW distribution for observed/simulated spec- 
tra; the results are shown in Fig. 3. The QSOs emission redshifts 
used and the Xrf interval chosen for the LR sample are such that 
the mean redshift of the absorbers is {z} = 5.26, with a mini- 
mum (maximum) redshift Zmin ~ 4.69 (z,nax = 5.86), and a 
r.m.s. deviation a — 0.06. For the HR sample it is {z) = 5.55, 
Zmin = 4.90, Zmax = 6.32, (7 = 0.14. The observed LGW dis- 
tribution evolves rapidly with redshift: in the LR sample most of 
the LOS are characterized by a largest gap < 40 A, whereas gaps 
as large as 100 A appear in the HR sample. This means that LOS 
to QSOs emitting at z^m ^ 6 encounter "optically thick" regions 
whose size is < 20 Mpc, while for Zem ^ 6 blank regions of size 
up to 46 Mpc are present. 

Superposed to the data in Fig. 3 are the predicted LGW distri- 
butions corresponding to ERM and LRM, obtained by simulating 
800/900 LOS in the LR/HR case, corresponding to 100 LOS for 
each emission redshift in each sample. In our ERM simulated spec- 
tra, at z « 6 gaps are produced by regions characterized by a 
mean overdensity A ~ 1 (A,nin = 0.05, Amax = 18) with a 
xui ~ 10""', averaging on 100 LOS (xni.min = 1-1 x 10"'', 

2;HI,max = 3.6 X 10"''). 

It results that both the predicted LGW distributions provide a good 
fit to observational data. We exploit the agreement between the sim- 
ulated and observed LGW distributions to derive an estimate of 



By applying the same method to the HR sample we constrain the 



-4.2 



+0.84 



at 



xm, shown in Fig. 1. We find logj^Q xui ~ —4.4 



+0.84 



at 2 « 5.3^ 



^ The Xm value quoted is the mean between the estimates predicted by the 
ERM and the LRM. Moreover, we consider the most conservative case in 
which the errors for the measurement of the neutral hydrogen fraction are 



neutral hydrogen fraction to be within logjQ xm 
z^5.6. 

Although the predicted LGW distributions are quite similar for the 
two models considered, yet some differences can be pointed out. 
Both for the LR and HR cases the early reionization LGW distribu- 
tion provides a very good match to the observed points, thus sug- 
gesting Zrei 7. The agreement is satisfactory also for the LRM, 
but it is important to notice that late reionization models predict too 
many largest gaps ~ 60 A in the LR case and too few gaps ~ 20 A 
in the HR one. Given the limited quasar sample available, the statis- 
tical relevance of the LRM discrepancies is not sufficient to firmly 
rule out this scenario. However, since in the HR case 40% of the 
lines of sight extend at z > 6, we can use the LRM results to put an 
upper limit on xm at this epoch. Indeed in the HR case we find that 
a neutral hydrogen fraction at z; ~ 6 higher than that one predicted 
by the LRM would imply an even worst agreement with observa- 
tions, since a more abundant HI would produce a lower (higher) 
fraction of LOS characterized by the largest gap smaller (higher) 
than 40 A with respect to observations. Thus, this study suggests 
Xm < 0.36 at z — 6.32 (obtained from the maximum value for 
Xm found in the LRM at this epoch). 



3.2.2 LPW distribution 

Next, we apply the Largest Peak Width (LPW) statistics (Fig 4) to 
both observed and simulated spectra. From the observed LPW dis- 
tribution we find that, in the LR (HR) sample, about 50% of the 
lines of sight exhibit peaks of width < 12(8) A. In more details, 
the size Pmax of the largest transmission regions in the observed 
sample are 3 < Pmax < 10 (1 < Pmax < 6) Mpc at (z) = 5.3 
(5.6). The frequency and the amplitude of the transmission regions 



provided by the minimum xm value found in the ERM and the maximum 
one in the LRM. 
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rapidly decrease toward high redshift. This could be due both to 
the enhancement of the neutral hydrogen abundance at epochs ap- 
proaching reionization or to evolutionary effects of the density field 
(Songaila 2004). In fact the growth factor D+ of density fluctua- 
tions decreases with redshift (Z?+ (z = 6) = 3/5x D-f- {z = 3) for 
ACDM), thus implying a low density contrast at z = 6 with re- 
spect to later epochs. Stated differently, underdense regions that are 
transparent at z = 3, were less underdense atz = 6, thus blocking 
transmission. As a consequence of the density field evolution to- 
ward higher z, only few/small peaks survive and wide GP troughs 
appear. 

Superposed to the data in Fig. 4 are the predicted LPW distri- 
butions corresponding to ERM and LRM, obtained by simulating 
800/900 LOS in the LR/HR case. In our ERM simulated spectra, 
at 2; « 6, gaps are interrupted by narrow transparent windows 
(i.e. peaks) originating from underdense regions with A « 0.1, 
averaging on 100 LOS (Amin = 0.03, Amax = 0.26) and 
ajHi ~ 2 X W'\ (xHi,min = 7.8 X 10-6, ajHi.max = 3.6 X 10-5). 
Regions characterized by A G [0.05; 0.26] and xm G [1-1 x 
IQ-^; 3.6 X 10^"'] could correspond to both gaps or peaks depend- 
ing on redshift and peculiar motions of the absorbers producing 
them. 

By comparing the simulated LPW distributions with the ob- 
served one, it is evident that simulations predict peak widths that 
are much smaller than the observed ones both for LR and HR cases. 
In particular, in no LOS of our simulated samples we find peaks 
larger than 8 A. The disagreement between the observed and simu- 
lated LPW distributions does not affect the estimate of xm through 
the LOW distributions, since at high redshift the peaks are narrow 
(< 10 A). We discuss the possible reasons for this discrepancy in 
the final Section. 



4 PHYSICAL INTERPRETATION OF THE PEAKS 

The most natural interpretation for the peaks is that they correspond 
to underdense regions, where the low HI density of the gas allows 
a high transmissivity. However, in principle they could also arise if 
individual ionized bubbles produced by QSOs and/or galaxies are 
crossed by the LOS. In the latter case the typical physical size and 
frequency of such semi-transparent regions must be related to the 
emission properties and masses of such objects. Stated differently, 
the fraction of LOS, Jlos, having the largest peak width equal to 
Pmax can be interpreted as the probability p to intersect an HII 
region of radius Rhii around a dark matter halo hosting either a 
QSO or a galaxy along the redshift path (zi — zf) spanned by the 
LOS. The comoving number density Uh of dark matter halos of 
mass Mh is related to p through the following equation: 



,3/2 ,2/ 



(1) 



Z c 

We take Rhii = 1(10) Mpc, consistent with the smaller (larger) 
size Pmax of the observed largest peaks in the HR (LR) sample. 
As it is likely that statistically the LOS crosses the bubble with 
non-zero impact parameter, adopting Rhii = Pmax seems a rea- 
sonable assumption. By further imposing p = fios we find that 
Uh = 3.7 X 10-6 (2.2 X 10-*) Mpc-^ for Pmax = 1 (10) Mpc 
in the redshift range Zi = 5 to z/ = 6. Given our cosmology, 
such halo number density can be transformed at z = 5.5 into a 
typical halo mass of Mh > 10^^ (lO") M© (Mo & White 2002). 
Thus, the halos hosting the putative luminous sources producing the 
peaks must be massive. Note that this result holds even if the QSO 



is shining only for a fraction of the Hubble time tg /tn ~ 10 ^ at 
z = 5.5. 

In addition to the peak frequency, additional constraints on the 
properties of the ionizing sources come from bubble physical sizes. 



4.1 QSO HII regions 

First, we consider the case in which the largest peaks are produced 
by HII regions around QSOs. The bubble size Rhii is related to 
the ionizing photons emission rate N-^ and QSO lifetime tq as 

1/3 



Rhii ~ 



V 47rnHi 



(2) 



where nni is the neutral hydrogen number density. Eq.(2) applies 
for a homogeneous IGM and does not take into account both re- 
combinations and relativistic effects. 
The recombination timescale tree is given by: 



tre 



[CasnH(l - a:Hi) 



(3) 



where C ~ 26.2917ea;p[-0.1822z-f0.003505«^] is the clumping 
factor (Iliev et al. 2007), as = 2.6 x 10-^'"* cm''* s"^ is the case B 
hydrogen recombination coefficient evaluated at T = lO** K, and 
riH = 7 x 10-5[(1 -I- z)/7]^cm~^ is the mean hydrogen number 
density. Thus, at the redshifts of interest x « 6, even in the limiting 
case Xm = 0, Uec 2 x lO^yr, thus being larger than typical 
QSOs lifetime tg ~ 10^. This shows that cq.(2) provides a plau- 
sible value for the HII region extent. For instance, Maselli et al. 
(2007) have shown that eq.(2) matches quite well the mean value 
of the HII region size determined through radiative transfer calcu- 
lations. 

In this Section, we neglect relativistic effects which could squash 
the ionization front along the sight-line (White et al. 2003; Wyithe 
et al. 2005b; Yu 2005; Shapiro et al. 2006), possibly reducing the 
length of the lines of sight interested by the proximity effect. We 
will discuss this issue in detail in Sec. 5, when addressing the first 
observed case of transverse proximity effect. 

Atz = 5.5, assimiing xhi = 5.6 x 10-^ (see Fig. 1), Rhii = 
1 (10) Mpc could be produced by a QSO emitting a number of 
ionizing photons iV-, = N-ytg = 7 x lO^"^ (7 x 10^*). Thus, 
assuming a QSO lifetime « lO^yr, the observed peaks in the LR 
(HR) sample require N-y = 2.2 x lO''^ (2.2 x lO''''') which 
would correspond to sources « 6 (3) orders of magnitude fainter 
than QSOs observed at z « 6, typically having Nj fs lO^^'s-^ 
(Haiman & Cen 2002) and black hole masses Mbh ~ 10^Mc5. 

So far we have assumed that the gas inside the HII region is 
fully ionized or, stated differently, that along the redshift path en- 
compassed by the ionized bubble the flux is completely transmitted. 
However, this is unlikely since a sufficiently high opacity due to 
resonant (damping wing) optical depth associated with the neutral 
hydrogen inside (outside) the HII region can produce dark gaps. 
Thus, the relation between Pmax and Rhii is 



p — 

^ max — 



H{z)XLyc. (1 

C (1 + 



-ftRnii = A{z)ftRHii: 



(4) 



where ft is the mean transmitted flux computed inside the prox- 
imity region. We will derive ft in Sec. 5 from an observed case of 
transverse proximity effect, note that values of /t < 1 would re- 
sult in a larger luminosity of the QSO producing the transmissivity 
window. 

Finally, powerful QSOs, as those observed at z « 6, could produce 
transmission windows consistent with observational data if they are 
embedded in overdense regions where the high density sustains an 
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initial neutral fraction, xm ^ 0.1, before the QSO turns on. The 
expansion of the HII region in such environment would result in 
considerably smaller sizes (Maselli et al. 2007). In this case, both 
the host dark matter halo mass found above {Mh « lO^^M©), and 
the size of the HII region would combine to give the correct fre- 
quency and spectral width of the observed peaks. 



4.2 Galaxy HII regions 

In addition to QSOs, transmissivity windows could be produced by 
HII regions around high-« galaxies. Adopting the canonical rela- 
tions 

M.^f.^Mh- N^ = nj — ; f.soN-, = ^nmR%n, (5) 

where M, is the stellar mass, n-y is the number of ionizing pho- 
tons per baryon into stars, and rup is the proton mass, the relation 
between and Rhii is given by: 

Mfe = 3 X 10* Mq (^-^\ V-iRhii, (6) 



6.5 

where y^i = {xmfr^fesc)/0-^ and we are assuming n-y — 4000, 
appropriate for a PopII stellar population with a standard Salpetcr 
IMF; we assume the fiducial values xui = 5.6 x 10~^, /* = 0.1, 
fesc = 0.01. The mass of an halo hosting a star-forming region 
able to produce Rhii ^ 1 (10) Mpc is 2 x 10® (2 x 10^^) Mq. At 
z ~ 5.5 objects of these masses corresponds to fluctuations of the 
density field > 1 - a (2 - cr) (Barkana & Loeb 2001). 
As for QSOs, the bubble size— peak frequency tension could be al- 
leviated if the galaxies live in overdense environments where the 
photoionization rate only supports a xui ~ 0.1 (resulting in a 
larger value of and hence of Mh in eq.(6)) prior to the onset 
of star formation in the galaxy. Obviously, the previous arguments 
neglect that because of clustering (Yu & Lu 2005; Kramer et al. 
2006), as multiple sources could power a single HII region; in or- 
der to get firmer results radiative transfer cosmological simulations 
are required. 



5 PEAKS FROM THE PROXIMITY EFFECT 

In Sec. 4, we have discussed the possibility that the observed peaks 
are produced by ionizing sources whose bubbles intersect the lines 
of sight to the target QSO. In this case one could ask if the source 
responsible for the HII region would be detected in the observed 
field. If the origin of transmissivity regions resides in bubbles 
around high-« galaxies, these sources are too faint to be seen in 
the SDSS; however, deep HST imaging (Stiavelli et al. 2005) could 
detect such objects. On the contrary, if the HII region of a quasar in- 
tervenes along the LOS to an higher redshift quasar, the first could 
be observed in the SDSS field. 

Mahabal et al. 2005 have discovered a faint quasar (RD 
Jl 148-1-5253, hereafter QSOl) at z = 5.70 in the field of the high- 
est redshift quasar currently known (SDSS J 1 148-1-5251, hereafter 
QS02) at 2: = 6.42. In this Section we study the QS02 transmit- 
ted flux, in order to analyze the proximity effect of QSOl on the 
QS02 spectrum. For clarity. Fig. 5 presents a schematic picture of 
the considered geometry. As the redshift z = 5.70 quoted by Ma- 
habal et al. 2005 is based on the peak of the Lya emission line, and 
the estimated error from such procedure is Az « 0.05 (Goodrich 
et al. 2001), we assume z^°^ = 5.65 and we discuss this issue in 
further details in Appendix B. 



The two QSOs have a projected separation of 109", which cor- 
responds to R± = 0.66 Mpc. The line of sight to QS02 intersects 
the bubble produced by QSOl for a redshift path (Azprox) whose 
length depends on the radius of the HII region (Rhii) itself. We 
find Rhii = 39 Mpc, by plugging in eq.(2) the following values: 
tQ = 1.34 x lOV. XHi = 8.4 X 10-^ Nj = 8.6 x lO^hec-^, 
where xbi is provided by the mean value between those predicted 
by our models atz = 5.7 (see rightmost panel of Fig. 1), while N-y 
is compatible with the luminosity of a QSO 3.5 magnitudes fainter 
than QS02 (Mahabal et al. 2005). In this Section, we also take into 
account relativistic effects which could squash the ionization front 
along the LOS (White et al. 2003; Wyithe et al. 2005b; Yu 2005; 
Shapiro et al. 2006). The apparent size of the HII region, computed 
following the method outlined in Yu 2005, is shown in Fig. 5. By 
zooming the region close to QSOl (small box in Fig. (5)) it is clear 
that the apparent size of the HII region extends up to 2 Mpc in 
the direction toward QS02. Given Rhii, taking into account rel- 
ativistic effects, the region Azprox extends from z — 5.16 up to 
z = 5.68. We re-compute xui along the LOS to QS02, adding 
to the uniform UVB photoionization rate Fhi the photoionization 
rate P^f provided by QSOl, obtained starting from the follow- 
ing equations: 



167r2il2 ' 



POO po 



(7) 



(8) 



(9) 



where ^hi is the hydrogen photoionization frequency threshold, ctq 
is the Thompson scattering cross section, R is the distance from 
QSOl to the LOS, Nv^^ is the rate of the emitted ionizing photons 
at the hydrogen photoionization frequency threshold and a = 1.5 
is the spectral index of the QSO continuum. Integrating eq.(9) we 
obtain : 



(a - l)iV^ 



Thus, it results: 



a — 1 \ Njao 



a + 2j 47r J?2 ' 



(10) 



(11) 



In Fig. 6 we compare the observed transmitted flux in the spec- 
trum of QS02 with the simulated fluxes along 3 different LOS 
with (bottom row) or without (top) including the contribution from 
QSOl to the total ionizing flux. For brevity, we refer to these case 
as "with bubble" or "without bubble". Visual inspection of Fig. 6 
shows that the case "with bubble" is in better agreement with ob- 
servations. Such statement can be made more quantitative by in- 
troducing a quantity denoted Peak Spectral Density (PSD), i.e. the 
number of peaks per unit Xrf interval. To compute the PSD for 
the two cases, we fix two different values for the flux threshold in- 
side (Ft'i^ = 0.01) and outside (F^^y^ = 0.08) the bubble. While 
Ffh^'^ is the same as the value used in this work so far, F-^^ has 
been chosen accordingly to the maximum observed noise r.m.s. de- 
viation (for reasons explained in the Appendix A) in the A rf in- 
terval AA = 1087 - 1092 A, where rgf °^ > Phi. For both the 
observed and simulated spectra, we compute the PSD inside and 
outside the bubble, finding the following results: 

{PSD°,r,PSDig) = (0.11,0.40); 



8 S. Gallerani, A. Ferrara, X. Fan, T. Roy Choudhury 




-60 -40 -20 20 40 ^ 

% 1050 1150 1050 1150 1050 1150 1050 1150 



Figure 5. Schematic positions of quasai's SDSS Jl 148+5251 (QS02, 
= 6.42, redshift position not in scale) and RD J1148+5252 (QSOl, 
Zem = 5.65). The projected separation is denoted by R±, the size of the 
HII region, Rhii, in the QSOl rest frame is represented by the dotted cir- 
cle; the magenta solid line shows the apparent shape of the ionization front; 
the dashed black line shows the redshift path ( A^proi) in which the bubble 
produced by QSOl intersects the LOS to QS02. 



{PSD?^J,PSDlf^) = (0.04lg;°t0.24l°:32|), 

where error bars provide the maximum and minimum PSD values 
found in the simulated LOS. Observationally, the PSD is found to 
be ~ 4 times'" larger inside that bubble than outside it. This boost 
is quite well reproduced by the simulated PSD, although their ab- 
solute values are somewhat lower than the observed ones. 

The physical interpretation of the results reported in this Sec- 
tion is the following. In the \rf (redshift) interval AA = 1087 — 
1092 (Azdet = 5.63 - 5.67), where rgf°^ > Thi, most of the 
gaps present in the case "without bubble" disappear, making room 
for peaks, as a consequence of the decreased opacity in the prox- 
imity of QSOl. Note that the Fhi value adopted in our calcula- 
tions (shown in Fig.l, middle panel, red line) is close to the max- 
imum value suggested by previous studies. Moreover, F^j^''^ ~ 
3.4 X Fhi at 0.66 Mpc from the foreground QSO. Thus, an implau- 
sible Fhi value should be assumed to explain the observed boost 
in the PSD with a uniform UVB. The enhancement in the transmis- 
sivity decreases for \rf smaller (larger) than 1087 (1092) A, since 
at the corresponding redshift F^j^*^^ < Fhi. These results (i) con- 
firm the detection of a proximity effect, (ii) show that the redshift 
stretch affected by the proximity effect is Azdat < Azprox- 

As a final test for our model, we compute the observed evo- 
lution of the optical depth as a function of the distance R from 
QSOl and compare it with the predictions of model "with bubble"; 

^ This factor depends on the flux threshold used. For example, it is reduced 
to 2.7 if F^^ = F°^'^ = 0.05. Nevertheless, for the puipose of 
our test what really matters is the boost of this factor moving from outside 
toward inside the bubble. 



Figure 6. Leftmost panels: Observed transmitted flux (black spectra) in the 
spectrum of QSO SDSS J1148-I-5251 (QS02, Zem = 6.42). The solid 
black line shows the redshift path {Azprox) in which the bubble produced 
by QSO RD J1148+5252 (QSOl, z^m = 5.65) intersects the LOS to 
QS02. Top panels (at), with i=l,3: Simulated fluxes (cyan spectra) along 
3 different random LOS (cases "without bubble"). Bottom panels (hi), with 
i=l,3: Simulated fluxes (magenta spectra) along the same LOSs shown in 
the top panels, taking into account the contribution from QSOl to the total 
ionizing flux (cases "with bubble"). 



the result is shown in Fig. 7. The agreement between observations 
and simulations is at l-a confidence level for 86% of the plotted 
points. For R < 2 Mpc, the mean optical depth 1.5 < t < 3.5 
is lower than the mean value expected at z = 5.65 (fs.ss ~ 4); it 
approaches fs.es at distances larger than Rt ~ 2 Mpc. 
By taking the difference between Rr and R± , we set a lower limit 
on the foreground QSO lifetime tg > _(_ (^t^ _ tQsoi) ~ 

11 Myr, where It and Iqsoi represent the cosmic times corre- 
sponding to the redshifts Zt = 5.68 and z^*^^ = 5.65, respec- 
tively. 

It is worth noting that our model does not take into account neither 
(i) the clustering of the ionizing sources, nor (ii) the overdense envi- 
ronment expected around the QSO. Both these effects, in principle, 
could strongly affect the IGM ionization state, albeit in opposite 
ways. While clustering of sources would enhance the transmissiv- 
ity in the QSO near-zones, the overdense environment would tend 
to suppress it. The fact that we found agreement between observa- 
tions and our modeling could indicate that, at least along this LOS, 
the two effects compensate. For what concerns (ii), by comparing 
the optical depth evolution observed in the proximity regions of 45 
QSOs at Zem ^ 4 with theoretical expectations, Guimaraes et al. 
2007 find evidence for a density bias correlated with the QSO lu- 
minosity. Since QSOl is much fainter than the QSOs studied by 
Guimaraes et al. 2007 it seems likely that neglecting such effect 
does not introduce a significant error. However, the extension of 
the proposed approach to a larger sample could clarify the relation 
between the clustering of sources and the overdensities in which 
massive objects are likely to be embedded. 

It is important to note that the LOS toward SDSSJl 148-1-5251 con- 



Glimpsing through the high redshift neutral hydrogen fog 9 



t- 4 



1 1 


1 


1 


1 1 


1 


1 1 


1 


1 


1 


1 


1 


1 ' 


1 . 




















































i 














■■■1 


I., 










1 .., 








1 


1 , , , 


1 


1 


1 


• 

1 


^ 

1 


[11^ 
ifitV 

1 1 


t8 
1 

it 

1 


+ 5J 
bub 
bu 

1 1 


351 . 
ble . 
bble_ 

, , r 



2 4 6 8 10 
R (Mpc) 



Figure 7. Evolution of the optical depth r as a function of the distance R 
from QSOl . Filled circles denote the observed mean value for r, while error 
bars represent the maximum and the minimum observed t at a given dis- 
tance from the foreground QSO. Solid (dotted) magenta lines are the mean 
(maximum/minimum) values from 500 simulated LOS, computed adopting 
the case "with bubble". The dashed cyan horizontal line shows the mean 
optical depth predicted by the ERM in con'espondence of the emission red- 
shift of the foreground QSO. The dotted cyan horizontal lines denote the 
maximum/minimum optical depth at the same redshift. 

tributes to the LPW distribution with the smallest peak (Pmax ~ 
2A) in the entire sample. Thus, even if we succeeded in reproduc- 
ing the features of this LOS with our model "with bubble", we still 
have to explain the mysterious origin of transmissivity windows as 
large as 10 — 15A. 

In Sec. 4.1, we estimate the QSOl luminosity required to explain 
the observed Pmax value, and we comment on the result depen- 
dence from ft- Plugging in eq.(4) the value ft « 0.03 computed 
inside the proximity region, we obtain an effective size for Rhii', 
by further using eq.(2), this translates into A'^^ = 9.2 x 10^^ s^^, a 
value in quite good agreement with the QSOl ionizing rate quoted 
by Mahabaletal. 2005. 



6 DISCUSSION 

We have studied several statistical properties of the transmitted flux 
in high-2; QSO spectra and compared them with those obtained 
from simulated Lya forest spectra to infer constraints on the ioniza- 
tion state of the IGM at z ~ 6. We have considered two different 
reionization models: (i) an Early Reionization Model (ERM), in 
which the universe reionizes at Zrei = 6, and (ii) a Late Reioniza- 
tion Model (Zrei ~ 7). 

By first using standard control statistics (mean transmitted flux evo- 
lution, probability distribution function of the transmitted flux, gap 
width distribution) in the redshift range 3.5 < 2: < 6, we show 
that both ERM and LRM match the observational data. This im- 
plies that current observations do not exclude that reionization can 
have taken place at redshift well beyond six. 



We then apply the Largest Gap Width (LGW) and Largest Peak 
Width (LPW) statistics introduced by Gallerani et al. (2006) to a 
sample of 17 QSOs in the redshift range 5.74 6.42. Both ERM 
and LRM provide good fits to the observed LGW distribution, fa- 
voring a scenario in which a^ni smoothly evolves from 10~*'^ at 
z ^ 5.3 to lO""-^ alz^ 5.6. 

Discriminating among the two reionization scenarios would 
require a sample of QSO at even higher redshifts. In fact, although 
according to LRM at z > 6 the reionization process is still in the 
overlap phase with a mixture of ionized and neutral regions char- 
acterizing the IGM, only ~ 10% of the simulated LOS pierce the 
overlap epoch, and for a redshift depth Az < 0.2. This explains 
why the predicted LGW distributions are quite similar for the two 
models considered. 

Nonetheless, ERM provides a slightly better fit to observational 
data with respect to LRM, favoring Zrei ^ 7. Within the statistical 
relevance of our sample, we have shown that LRM models can be 
used to put a robust upper limit xm < 0.36 at z = 6.3. 

We have suggested that peaks preferentially arise from under- 
dense regions of the cosmic density field and also from isolated HII 
regions produced by either faint quasars or galaxies. The frequency 
of the observed peaks implies that the dark matter halos hosting 
such sources is relatively large, f» 10^^ (10^^) Mq. Bright QSOs 
are unlikely to contribute significantly in terms of peaks, because 
given the required size of the HII regions, they should be located 
close enough to the LOS to the target QSO, that they should be 
detectable in the field. 

The previous conclusions are substantiated by the specific 
case of an intervening HII region produced by the faint quasar RD 
J1I48-I-5253 (QSOl) al z = 5.70 along the LOS toward the high- 
est redshift quasar currently known (SDSS JI 148-1-5251, QS02) at 
z — 6.42. It is worth noting that searches for the transverse prox- 
imity effect in the HI Lya forest at 2 ~ 3 (Schirber et al. 2004) 
have been so far unsuccessful. Such effect has been isolated only by 
Hell absorption studies (Worseck & Wisotzki 2006;Worseck et al. 
2007). Thus, our results represent the first-ever detection in the HI 
Lya forest. We have analyzed the proximity effect of QSOl on the 
QS02 spectrum. Moreover, we have build up a simple model to 
estimate the location/extension of the proximity zone. Within the 
proximity region of QSOl we have found an increased number of 
peaks per unit frequency with respect to segments of the LOS lo- 
cated outside the quasar HII bubble. This supports the idea that we 
are indeed sampling the proximity region of the QSOl and that 
at least some peaks originate within ionized regions around (faint) 
sources. We then obtain a strong lower limit on the foreground QSO 
lifetime of tg > 11 Myr. Proper inclusion of galaxy clustering, 
which requires numerical simulations, might affect our conclusions 
(Faucher-Giguere et al. 2007). Note that even in this clear-cut case, 
the size of the largest observed peak in the spectrum of QS02 is 
only of 2 A. 

Thus we are left with the puzzling discrepancy between ob- 
served and simulated transmissivity windows (peaks) size, the for- 
mer being systematically larger. Very likely, this reflects an unwar- 
ranted assumption made by the model. We do not believe that the 
discrepancy could be impute to the assumption of a Log-Normal 
model, tested against HYDROPM simulations by GCF06. Never- 
theless, we plan to compare the observed Largest Peak Width dis- 
tribution with full hydrodynamical simulations in a future work to 
study the correlation properties of the underdense regions, since 
Coles et al. (1993) have shown that the Log-Normal model pro- 
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duces a too "clumpy" distribution of tiie density field, when com- 
pared with N-body simulations. 

At least two physical effects, neglected here, could affect the 
calculation of xhi: (i) non-equilibrium photoionization, and (ii) UV 
background radiation fluctuations. 

The first assumption is made by the majority of studies dealing with 
the Lya forest. However, if a fraction of the Lya forest gas has 
been shock-heated as it condenses into the cosmic web filaments, it 
might cool faster than it recombines. For example, the recombina- 
tion time tr becomes longer than the Hubble time when the density 
contrast is A < 7.5[(l-|-2;)/6.5]~^^^; hence, large deviations from 
photoionization equilibrium are expected where A <C 1. Lower 
values of xm with respect to equilibrium are expected in such re- 
gions, as a result of the exceedingly slow recombination rates. 

The second possible explanation for the too narrow simulated 
peaks might reside in radiative transfer effects, also neglected here. 
At 2 « 6 the increase in the mean GP optical depth is accompanied 
by an evident enhancement of the dispersion of this measurement 
which has been ascribed to spatial fluctuations of the UVB intensity 
near the end of reionization. A considerable (up to 10%) scatter in 
the UVB HI photoionization rate is expected already at a « 3, as 
shown by Maselli & Ferrara 2005 through detailed radiative trans- 
fer calculations. The amplitude of such illumination fluctuations 
tend to increase with redshift because of the overall thickening of 
the forest. Although the observed dispersion in the mean GP optical 
depth may be compatible with a spatially uniform UVB (Liu et al. 
2006; Lidz et al. 2006), it is likely that a proper radiative transfer 
treatment becomes mandatory at earlier times. Basically, the main 
effect of fluctuations is to break the dependence of the HI neutral 
fraction on density. This is readily understood by considering two 
perturbations with the same density contrast A. If the first is close 
to a luminous source it will have its xm depressed well below that 
of the second one located away from it. Thus, opacity fluctuations 
naturally arise. If so, peaks of larger width could be produced if the 
density perturbation associated with it happens to be located in a 
region where the UVB intensity is higher than the mean. 
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7 APPENDIX A 

The gap/peak statistics are sensitive to the S/N ratio, since spurious 
peaks could arise in spectral regions with noise higher than the flux 
threshold {Fth) adopted. In what follows we restrict our attention 
to "gaps", since the extension of the conclusions on the "peaks" is 
direct. In particular, in this Appendix, we discuss the LGW distri- 
bution shape dependence on the Fth chosen. We consider two val- 
ues for Fth, namely 0.03 and 0.08, which correspond to r = 3.5 
and r = 2.5, respectively. It is not obvious what criterion to apply 
in order to choose a proper value for Fth, since a too high (low) 
Fth could overestimate (underestimate) the gap length. In Fig. 8 
we show two examples of spectra in which the Fth choice strongly 
affects the gap measurement. On the bottom, the spectrum of QSO 
J1030-I-0524 is shown, and, in the small box, the region marked by 
the solid black line is zoomed. It is evident that Fth ~ 0.03 would 
break the gap at \rf ~ 1190 A, instead of at Xrf ~ II6OA, as 
also noticed by F06; thus, in this case Fth ~ 0.08 seems to be a 
better choice. 

The opposite is true for the spectrum of QSO Jl 148-1-5251, shown 
in Fig. 8 on the top. Indeed, in this case Fth ~ 0.08 would pro- 
vide a gap as large as ~ lOOA, terminated by transmission at 
Xrf ~ llOOA. However, the peak at Xrf ~ II6OA is consistent 
with pure transmission (White et al. 2003; Oh & Furlanetto 2005; 
F06); thus, in this case, Fth = 0.03 would provide the correct gap 
measurement. For this reason, we compute the LGW distribution, 
considering both Fth = 0.03 and Fth = 0.08, alternatively. 
The final LGW distribution is obtained as the mean of the prelimi- 
nary ones, weighted on the corresponding errors. 



8 APPENDIX B 

The redshift quoted by Mahabal et al. 2005 for RD J1I48-I-5253 
(Zem ~ 5.7) is based on the peak of the Lya emission line. 
This standard approach typically overestimates the true redshift by 
« 0.05 (e.g. Goodrich et al. 2001). For this reason we adopt as 
fiducial value z^m = 5.65. As the object RD J1I48-I-5253 (QSOl) 
is a BAL QSO (Mahabal et al. 2005), its emission redshift can not 
be established with accuracy from the broad metal lines, thus re- 
maining uncertain. By comparing the QSOl absorption spectrum 
with a BAL composite, also Zem = 5.725 could be a plausible 
choice for the QSOl emission redshift (Willot C., private commu- 
nication). In this Appendix we repeat the analysis shown in Sec. 5 
considering different possibilities for the QSOl emission redshift. 
In Fig. 9 we compare the optical depth evolution as a function of 
the distance from QSOl obtained assuming z^^^^ — 5.65 (left 
panel) with the cases in which z^^'^^ = 5.70 (middle panel) and 
2?m°^ = 5.725 (right panel). In table 8 the results of the Peak 
Spectral Density (PSD) for the tree different choices of QSOl emis- 
sion redshift are shown, together with the wavelength interval AA 
where Pgf °' > Thi. 



: z,„=6.42 




1050 1100 1150 1200 

MA) 

Figure 8. Observed spectra of the QSO SDSS J 11 48+5251 (top panel) and 
SDSS J1030+0524 {bottom panel). The black line denote the largest dark 
gap, measured by assuming Fth = 0.08. In the small box the region in- 
terested by the largest dark gap is zoomed. The two black lines indicate 
Fth = 0.08 and Fth = 0.03. From the top (bottom) panel is evident that 
Fth = 0.08 (Fth = 0.03) overestimates (underestimates) the size of the 
largest dark gap. 
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Even though Fig. 9 shows that the observed optical depth evolution 
as a function of the distance from QSOl is better explained by our 
model assuming ^Sn*^^ = 5.65, this result does not rule out other 
choices of the QSOl emission redshift. The results shown in table 
8 confirm the the evidence of the transverse proximity effect, since 
the boost in the PSD moving from outside toward inside the bubble 
is present in all the three cases considered. 
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Figure 9. Same as Fig. 7, but for different clioices of foreground QSO emission redshift: Ze„i = 5.65 (leftmost panel); z^m = 5.70 (middle panel); 
Zem = 5.725 (rightmost panel). 



